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Abstract 

We prove that every linear-activity automaton group is amenable. The proof is based 
on showing that a sufficiently symmetric random walk on a specially constructed degree 1 
automaton group — the mother group — has asymptotic entropy 0. Our result answers 
an open question by Nekrashevich in the Kourovka notebook, and gives a partial answer 
to a question of Sidki. 




Figure 1: A Schreier graph for a linear-activity group 



1 Introduction 

Automaton groups are the core of the algebraic theory of fractals. Just as fractals do in 
geometry, automata groups form a rich new world within group theory. This world has been 
a source of many interesting examples, but until recently it resisted a general theory. Yet the 
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simplicity of the definitions and the richness of examples suggest the existence of such a theory. 
The goal of this paper is to make a step in this direction by proving that all linear-activity 
groups are amenable. 

Automaton groups arise in various areas of mathematics; examples include: 



The Grigorchuk group, (see iGrigorchukl . Il984j ) w hich has faster than polynomial but subex- 
ponential growth, answering the question of iMilnorl (119681 ) about the existence of such 
groups. 

The basilica group, a finitely generated amenable group containing a non-cyclic free semi- 
group. Its level Schreier g raphs approximate the Basi lica f ractal, that is the Jul i a set of 
the polynomial z'^ — 1, see I Grigorchuk and ZukI ( 120021 ) and iBartholdi and Viragl (120051 ) . 



The Hanoi towers group, the group of possible moves in the Hanoi Towers game on three 
pegs, a game introduc ed by Edouard Lucas in 1883 . Its level Schreier graphs are discrete 
Sierpinski gaskets (see iGrigorchuk and Sunig . |2006| ) . 



The long-range group, an int e restin g group whose Schreier graphs (Figure ) were studied by 
Benjamini and HoffmanI ( 2005 ) in the context of long-range percolation theory. 



The lamplighter group on Z, the first example of a group with a Cayley graph that has 
discrete spectrum, p r ovidin g also a counterexample to the strong Atiyah conjecture. See 



Grigorchuk and ZukI (1200 ih 



Automaton groups are subgroups of the automorphism group Aut(T„ 
m-ary tree. Every element of Aut(Tm) can be decomposed as 



of the rooted infinite 



9 = {{9i 



0; 



,gm~i))cr, 



Qi e Aut(T^), a e Sym(m) 



where the Qi, called first-level sections, now act on the subtrees rooted at the children of the 
root of Tm, and a permutes these subtrees. An automaton A is a finite subset of Aut(Tm) 
so that first level sections of elements of A are also in A. An auto maton group is a group 
generated by an automaton. See, e.g., ISunic and GrigorchukI (120071 ) and references there for a 
general survey on automaton groups. 



A systematic study of automaton groups was initiated by ISidkil ( l2000l ). who introduced 
the concept of activity growth (or more briefiy, activity), a measure of size for automaton 
groups. We defer the precise definition for later, but note that the activity can be polynomial 
of any degree or exponential. There are exponential ac t ivity growth automaton groups tha t 



are isomorphic to the free group (see iGlasner and Mozed . 
However, one expects polynomial automaton groups to 
with all known finitely generated non-amenable groups, 



2005 



Vorobets and Vorobets 



2003). 



3e smaller. In particular, in contrast 



Sidkil ( I2OOOI ) showed that polynomial 
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automaton groups have no free subgroups. In light of this dichotomy, Sidki asked the following 
natural question. 

Question. Are all polynomial activity automaton groups amenable? 

It seems that the structure of polynomial automaton groups depends a lot on the degree, 
and we do not have a conjectured answer to Sidki's questio n. The first result in this direction is 
due to iBartholdi. Kaimanovich. and NekrashevychI (120081 ). who showed that all degree (also 



called bounded) automaton grou ps are amenable. The i r proo f uses a variant of the self-similar 
random walk idea introduced in IBartholdi and Viragl (I20051) to proy e the amenability of the 



basilica group and later streamlined and generalized in iKaimanovichl (120051 ). The goal of this 
paper is to show the following. 

Theorem 1. All linear- activity automaton groups are amenable. 

Example. A simple example of our theorem is given by the long-range group: Let b act on 

the integers by increasing them by 1, and let a act on integers by increasing them by the lowest 
power of 2 they are not divisible by. The group generated by a, b is called the long-range group; 
its Schreier graph is shown in Figure . Its automaton has two states defined recursively by 

a=((a,6)), 6=((6,1))(01). 



The question of the amena bility of the long-range gr oup was posed by Nekrashevich, in 



the 



Kourovka notebook (2006)1 . Question 16.74, and also in lGuido's book of conjectures (2008) 



Conjecture 35.9. 

Our pape r build s on previous work in the t heory, including IBartholdi and Virag (120051 ) . 
KaimanovichI ( 20051 ) . and IBartholdi et al.l ( 20081 ). The first part of the proof is to construct a 
family of linear-activity automaton groups — the mother groups — which are then shown to 
co ntain subgroups isomo rphic to every linear-activity automaton group. This has been done 



m 



Bartholdi et al. 



(120081 ) for bounded automaton groups. We show how to perform this step 
for polynomial automata of any degree. It then suffices to show that the linear-activity mother 
groups are amenable. 

The second part is to find a random walk on the mother group with asymptotic entropy. 
We can only do this for linear-activity mother groups. In fact, we conjecture that there is a 
phase transition for this question, namely such walks only exist up to degree 2. 

Conjecture 2. The mother group of degree d has a random walk whose step distribution is sup- 
ported on a finite generating set and whose asymptotic entropy is zero if and only if d = 0, 1, 2. 

For d = 1 this follows from this paper. For all d > 2 this conjecture is still open. 
Compared to previous work, the random walks we consider are no longer self-similar or 
even finitely supported. The role of self-similar walks is played by the evolution of random 
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walk step measures that we can control. For such control, we need a high level of symmetry 
for the random walk. However, we believe that this is an artifact of the proof. 



Conjecture 3. Every random walk with houndedly supported step distribution on a polynomial 
automaton group with degree at most 2 has zero asymptotic entropy. 

This conjecture is open even for = 0. 

The main tool for the analysis of asymptotic entropy is study of the so-called ascension 
operator. Consider a random walk walk with step distribution /i on an automaton group, 
and let v be the first child of the root of T^. The section of the walk at f , looked at the 
times at which the walk fixes v forms another random walk with step distribution /i'. The map 
T : fi \^ fi' is called the ascension operator (see furt her details in Section Bl). 

Ascension of random walks first con sidered in Bartholdi and Viragl (120051 ). The ascen- 
sion operator in this form appeared in iKaimanovich J20051 ). where the entropy inequality 



HooifJ') < HooiTji) was also proved. Iterating this inequality, one gets ifoo(/^) < -f^oo(^"/^)- 
In this paper, we will analyze T"/i in the case when these measures are not finitely supported 
and not computable exactly. The following proposition allows us to relate Hoo{fi) to the asymp- 
totic entropy of limit points of the sequence of measures T"/i. 

Proposition 4 (Upper semi-continuity of asymptotic entropy). Ifi'n, ^ o-^e probability measures 
on a countable group so that i/„ — z/ weakly and H{vn) — > H{v), then 

limsup(ifoo('^n)) < Hoo{y). 

n— >oo 

In light of Proposition |U it suffices to find a subsequence along which T^^ — > v and 
H{T'"'fi) —>■ Hlu), and u has zero asymptotic entropy. Perhaps surprisingly, it is not too difficult 
to show that for appropriate /i, any subsequential limit point u has zero asymptotic entropy. 
For the other two claims, it suffices to show the tightness and entropy-tightness (defined below) 
of the sequence T"/x. Proving these facts takes up a large part of this paper. Showing the 
tightness of the sequence T"/i is the main obstacle to extending our proof to the higher degree 
case. For convenient reference, we summarize the preceding discussion in a theorem. 

Theorem 5 (Asymptotic entropy of automaton groups). If the group generated by the support 
of fi acts transitively on all levels of the tree and the sequence T^fi is entropy-tight, then for 
every subsequential weak limit point v we have 

The main challenge is to construct measures for which the ascension operator is tractable. 
The measures that we consider are based on the uniform measures on certain finite subgroups 
of the mother group 2Jl. They have the property that if a probability measure // on OJt is a 
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convex combination of convolution products of the g^'s, then so is Tjj,. Thus the iterated 
ascensions T^^ of such measures can be understood in terms of the g^'s, which are sufficiently 
symmetric and can be controlled. Further details can be found in Section [61 

Organization. The structure of this paper is as follows. Sections [2H1] contain the setup, 
review and proof of some preliminary results; readers familiar with previous work on the subject 
may want to skip these sections. Section [2] reviews basic concepts related to automata groups. 
In Section[3]we do the same for entropy, and prove Proposition|H In SectionlHwe introduce and 
study the ascension operator. Mother groups are defined in Section [5], and it is shown that they 
contain all polynomial automata groups. In Section [6] we introduce a special class of measures, 
called patterns, and an algebraic way to study them. We also define an ascension operator 
for patterns. In Section [7] we study properties of iterated ascension on patterns. Sections [8] 
contains some preliminary results on entropy of pattern measures, and finally, the main theorem 
is proved in O 

2 Automata and their groups 

Basic definitions. Finite automata are the simplest interesting model of computing; we first 
connect our definition of automata to the more traditional one. 

The space of words in alphabet {0, . . . ,m — 1} has a natural tree structure, with {wx^xKm 
being the children of the finite word w, and the empty word being the root. Let denote 
this tree. 

A finite automaton on m symbols is a finite set of states A with a map A A"^ x Sym(m) 
sending a i— >• (oq, . . . , Om-i, o"a)- We will use the notation 

a = ((ao, . . . , am^i))aa. 

An automaton acts on words in alphabet {0, . . . , m — 1} sequentially. When the automaton 
is in a state a and reads a letter x, it outputs x.aa and moves to state a^. From this state the 
automaton acts on the rest of the word. Symbolically, for a word xw (starting with a letter x) 
we have the recursive definition 

{xw).a = {x.cra){w.ax). (1) 

The first k symbols of the output are determined by the first k symbols read. Note that the 
action is defined for for both finite and infinite words, and that the action on infinite words 
determines the action on finite words and vice versa. It follows that each element a G A is 
an automorphism of T^. The automaton group corresponding to an automaton A is the 
subgroup of Aut(Tm) generated by A. 
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The action ([T]) corresponds to the following multiplication rule: 

((ao, . . . , ad))a{{bo, bd))r = ((ao&o.a, • • • , am-i&(m-i).a))cr^- 

This multiplication rule can be used to define automaton groups without any reference to 
automorphisms of the tree. However, keeping the action on the tree in mind makes some 
constructions used in the proof more natural. 
We use the conjugation notation = b^^ab. 

The notion of first-level sections can be generalized to any level. If f G is a finite 
word and g G Aut{Tm), then there is a word v' of equal length to v and an automorphism 
g' G Aut(Tm) such that vw.g = v'{w.g'), for every word w. This g' is called the section of g 
at V. Informally, g' is the action of g on the subtree above the vertex v. The section of g at v 
is denoted g{v). 

Activity growth of automaton groups. For any state a G A, the number of length-n 
words V such that the section a{v) is not the identity satisfies a certain linear recursion. Thus 
this number grows either polynomially with some degree d or exponentially. We define the 
degree of activity growth (in short, degree) of a to be d or oo, respectively. The degree of 
an automaton group is the maximal degree of any of its generators. Automaton groups are said 
to have bounded, linear, polynomial or exponential activity growth when their degree is 
0, 1, finite or infinite, respectively. 

Degree and cycle structure. An automaton gives rise to a directed graph where there is 
a directed edge from a state a to each of its first-level sections a{i). If the same state appears 
more than once as an then there are parallel edges. If a appears as a first-level section of 
itself (i.e., a{i) = a) then there is a loop at a. Thus the only edges leaving the identity (if it is 
a state of A) are loops back to the identity. The loops at the identity are called trivial cycles. 
To avoid degeneracies in the graph, we will assume from now on that all non-identity states in 
the automaton act non-trivially. This assumption, which does not limit the generality of the 
automaton groups considered, allows some connections between the structure of the said graph 
and properties of the automaton. 

The number of active vertices of a at level n is just the number of directed paths of length 
n starting at a and ending anywhere but the identity. It follows easily that an automaton is 
exponential if and only if there are two nontrivial cycles so that each is reachable from the 
other via a directed path. 

The nontrivial directed cycles in the directed graph of a polynomial automaton have a 
partial order: ci < C2 if Ci 7^ C2 and there is a directed path from a state of C2 to a state 
of Ci. Define the degree of a cycle c as the maximal n so that there is an increasing chain 
Cq < Ci < . . . < Cn = c. It is straightforward to see that the degree of a polynomial automaton 
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is the maximal degree of any cycle in its directed graph. The degree of a state g is the maximal 
degree of a cycle reachable via a directed path from g. The identity 1 is always considered to 
have degree -1. 

Hierarchy levels. For polynomial automaton, the hierarchy level of a state is a refinement 
of its degree, taking values in the sequence 



States in cycles of degree d have hierarchy level d. States of degree d that are not in a cycle 
have hierarchy level d. A state can point to states either in its own hierarchy level or lower 
levels. The only state with hierarchy level —1 is the identity. 

Collapsing levels. By dividing words in m symbols into blocks of length k, we can view them 
as words in symbols. Similarly, given an automaton A acting on m symbols, it naturally 
gives rise to an automaton with the same states acting on words in m*^ symbols; we call this 
new automaton the fc-collapse of A, and it acts on the tree T„fc. 

Claim 2.1. The automaton groups of A and of its k-collapse are isomorphic. 

Proof. The key is that the the vertices of T^fe are naturally associated with every /c'th level 
of Tm, and that Aut(Tm) C Aut(T^fe). Restricting this embedding to G = (A) gives an 
isomorphism from the automaton group of A to the group of the fc-collapse of A. □ 

3 Entropy and asymptotic entropy 

The purpose of this section is to review the notion of entropy and to prove Proposition l3.3l below. 
which gives a condition for upper semi-continuity of the asymptotic entropy of a random walk 
on a group. 

Recall that the entropy of a finite non-negative measure fi supported on a countable set G 
is defined by 



where by convention OlogO = 0. The entropy H{X) of a discrete random variable X is given 
by the entropy of its distribution; the entropy H{Xi, . . . X„) of more random variables is given 
by the entropy of the joint distribution of the Xj. In order to define conditional entropy for 
two random variables X, Y, let f{y) denote entropy of the conditional distribution of X given 
Y = y. Then the conditional entropy of X given Y is defined as H{X\Y) := E/(F). 



-1,- 



1,0,0,1,1,2,2,... 
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The conditional entropy satisfies 



H{X,Y) = H{X\Y) + H{Y). 

A useful and easy fact is that among measures supported on a given finite set, the one having 
maximal entropy is the uniform measure on that set. Thus for a measure n supported on a 
finite set B we have H{fi) < log(|i?|). It is not hard to construct measures on Z with finite 
expectation and infinite entropy, but it is known (see, for example, iKempl . 119971 ) that among 
integer- valued random variables with given mean and variance the discrete normal distribution 
maximizes entropy. In particular, the following holds. 

Lemma 3.1. There is a constant c such that for any integer-valued random variable N we have 

H{N) < C + (logVarA^)+, 

where = max(0, x). 

Define the asymptotic entropy of a sequence of random variables Xn as 

limsup —H{Xn). 

If a random walk on a group G has i.i.d. steps with distribution given by /i, its asymptotic 
entropy is given by 

ffoo(/i) = lim 

where /i*" is the n-fold convolution of yU, or, equivalently, the distribution of the n'th step of 
the random walk. The limit exists by subadditivity. 

We say that a set of probability measures {yUj} is entropy-tight if for every e there exists 
a finite set K so that for all i 

-f^i{x) log ^ii{x) < e. (2) 

In other words, entropy-tightness means the uniform integrability of the function log 
with respect to counting measure. The importance of entropy-tightness comes from the follow- 
ing direct application of Vitali's convergence lemma. 

Lemma 3.2. Assume that /in weakly. Then H{fin) — ^ if and only if the sequence 

fin is entropy -tight. 

The aim of this section is the following result: 

Proposition 3.3. If fii ^ fi and {/ij} is entropy-tight, then limsup ifoo(/^i) < -f^oo(/^)- 

We begin with two lemmas. 
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Lemma 3.4. For any two measures /i and v we have H{fi * u) < |z/|iJ(/i) + |/i|iJ(z/). 
Proof. For the product measure /i x z/ we have 

xu) = ^^{x)iy{y) log(/x(x)z/(?/)) 

= \fi\H{u) + \u\H{fi). 

Now by subadditivity of the function — xloga; we have H{fi * u) < H{fi x u). The lemma 
follows. □ 

Lemma 3.5. // a family of probability measures {/ij} is both tight and entropy-tight then so is 
the set of all their n-fold convolutions {^i^ * ■ ■ ■ * 

Proof. It suffices to prove this for n = 2, with larger n following by induction. For a measure 
/i we denote the measure restricted to A, without normalization, by /i^. 

Fix some e > 0. The conditions imply that there is a finite set K such that for all i, 

i/zn=/i,(ir^)<e, H{^^r)<e. 

For any two measures /i, u from the set we have 

Consider the set finite set B = K * K, and note that the support of /i^ * is contained in B. 
It follows that 

which can be made arbitrarily small, hence the convolutions form a tight family. 

Note that for a, 6 > we have —a log a — 6 log 6 > — (a + 6) log(a + 6), so that entropy of 
measures is subadditive. Note also that entropy-tightness implies that for some M, for every i 
we have H{fii) < M. 

It follows using Lemma [3.41 that 

H ((^ * uf') < H (/x^^ * z/^ + /i^ * u^' + /x^^ * u^') 
< 2Me + 2e + 2e^. 

Since this too can be made arbitrarily small, the convolutions are also entropy-tight. □ 
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Proof of Proposition \3. 3[ Fix some n. By Lemma 13.51 we have that are entropy-tight, 

and so by Lemma 13.21 we have 

H{f,*-) = Mm Hif^r). 

J— >oo 

The sequence H{fi*"-) is subadditive, and therefore 

n 

Taking the hm sup of both sides we get 

hmsupifoo(/ii) < -hmsupif(/i*") = -H{fi*''). 

i — ^oo ^ i — ^oo ^ 

Since n is arbitrary, we can take a hmit as n — oo to conclude the proof. □ 



4 Random walks and the ascension operator 

In our proof we use the method, introduced by iBartholdi and Vira of studying a 

random walk on an automaton group by looking at its induced action on the subtree above a 
vertex v, specifically at times n at which X„ fixes v. 

Random walks on quotients and subgroups. Let {Xn}n>o be a random walk on a count- 
able group G started at Xq = 1. 

If is a normal subgroup of G, and K = G/N, then X has a natural projection to K, 
namely Yn = NXn- We call this the quotient random walk on G/N. 

A set 5 C G is called recurrent (for a random walk X) if X visits S infinitely often with 
probability one. For example, every finite index subgroup is recurrent in G. For a recurrent 
subgroup 5* the steps at which the random walk visits 5* form a random walk Y on S (i.e. if r„ 
is the nth visit to S, let Yn = X^^). We call this the induced random walk on S. 

The ascension operator. Given an automaton group A and a vertex j G T^, consider the 
stabilizer subgroup Aj of j. Consider also the subgroup A'j which is the stabilizer of the entire 
subtree above vertex j. Then A'j is normal in Aj. For g G Aj the coset gA'j consists of all 
elements with the same action g{j) on the subtree above j. Since A is an automaton group, 
g{j) G A, hence the group Aj/A'j is therefore canonically isomorphic to a subgroup of A. 

We now specialize to vertex j = in the first level of T^. Given a random walk on A with 
step distribution /i, we can consider the induced walk on Aq, and then its quotient walk on 
Aq/Aq. By the above, this again can be viewed as a random walk on A with step distribution 
yu'. The ascension operator T is the operator that maps each probability measure fi on A, 
to the measure fi' above. If r„ are the times at which X^-^ fixes vertex 0, then the actions of 
Xt-^ on the subtree above are a random walk with step distribution Tfi. 
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We say that a measure /i is transitive on level k if the group generated by its support acts 
transitively on that level of T^- We will use the following entropy inequality. 



Theorem 4.1 (IKaimanovichl (120051 ). Theorem 3.1). Assume that a probability measure fi is 
transitive on the first level. Then the asymptotic entropies satisfy Hoo{fi) < HooiTfi). 

The upper semi-continuity of asymptotic entropy, Proposition H] yields the following. 

Theorem 4.2 (Asymptotic entropy of automaton groups). If the group generated by the support 
of fi is transitive on all levels and the sequence {T'^fJ.} is entropy-tight, then for any subsequential 
limit point v we have 

Proof. The transitivity of implies that T^^ is transitive on the first level for all k. Repeated 
application of Theorem 14.11 shows that for each k 

HM < i/oc(TV)- 

Taking limsup along the subsequence converging to u and using Proposition 13.31 gives 

Hooifi) < limsup//oo(rV) < H^{iy). □ 



5 Mother groups 

The (m, r)-mother group, denoted Tlm,r, is defined as the automaton group generated by the 
following states 

ttfc.a = ((ttfc.a, ttfc-La, 1, . . . , 1)), < A; < r, 
tt-l,(T = cr 

I3k,p = {{f3k,p, I3k-i,p, 1, . . . , 1)), l<k<r, 
f3o,p = {{f3o,p, 1, • • • , 

where <J, p E Sym(m) are arbitrary, subject to O.p = 0. The number of states in OJl^.r as defined 
here is m!(r + 1) + (m — l)!r. The same group can be generated with fewer states by taking a, p 
only in a minimal, (2-element) set of generators of Sym(m) and stab(O) C Sym(m) respectively. 
This would give a generating set of size 4r + 2. After this section m will not play an important 
role, so we also denote the group by OJt^. 

The actions of au^a and [3k,p on a word have simple descriptions. Both read the word and 
make no changes up to the A; + I'th non-zero letter. 

• If the first k + 1 nonzero letters in a word are all 1, then Uk^a permutes the next letter by 
cr. Otherwise it does nothing. 
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• If the first k nonzero letters in a word are all 1, then f3k^p permutes the next nonzero letter 
by p. Otherwise it does nothing. 

Thus both affect only the k + Vst non-zero letter and the letter immediately following it. 

Note that ak^a and Pk,p both have self-loops and are of degree k, so they have hierarchy 
level k. 

Theorem 5.1 (Mother groups contain all). Every degree-r automaton group is isomorphic to 
a subgroup ofDJlrn,r for some m. 

Note that m is generally not the same as the degree of the tree on which the automaton 
acts. 

Proof. We prove that there exist m', m" so that the each of the following groups can be iso- 
morphically embedded in the next: A C Gm'.r C ^Jt^, ^ C OJtm",r- The intermediate groups are 
defined below. The three containments are proved in Lemmas 15.21 15.61 15.71 □ 

The first reduction 

For each hierarchy level h G { — 1,— 1,0,0,1,1,2,2,...} we define an automaton group as fol- 
lows. = {!}, and Gm,-i = Sym(m). For any r > the group Gm,f is obtained by adding 
to Gm,r-i all possible states of the form: 

g = {{go, ■ ■ ■,gm-i))o- 

(with a G Sym(m)) satisfying the following. There is a unique i such that gi = g, and for j i 
we have that gj is a state of Gm,r-i- 

Similarly, for r > we define Gm,r by adding to Gm,f all possible states of the form: 

g = {{go, ■ ■ ■,gm-i))o- 

where a G Sym(m), and gj is a state of Gm,f for all j. 

Since they are larger than the mother groups, and are also predecessors of the mother 
groups, in this section we will refer to the groups G„i,h as grand-mother groups. 

Lemma 5.2. Let A he an automaton group with all states of hierarchy level at most h. Then 
for some k, the k-collapse of A is isomorphic to a subgroup of G^k^h 

Proof. Let A; G Z be a multiple of all cycle lengths of A and larger than the length of any 
simple directed path in A. The /c-coUapsed version A' of A has the property that all of its 
cycles are loops. States with loops have a single loop only, since otherwise A would have been 
exponential. 

A state of a given hierarchy level can only point to a state from a lower level, except for 
states with loops that also point to themselves. By construction, the grandmother group Gm',r 
contains all possible states of this form. □ 
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Extended mother groups 

Let Wk denote the finite subgroup of Aut(TTO) for wliicli all sections at level k are the identity. 

Definition 5.3. The extended mother groups SDTj^ ~, OJlJ^^ are defined the same way as 
the ordinary mother groups ^Xflm,r, except that a ranges over all elements of Wr+2 and Wr+3, 
respectively (the options for p are unchanged). 

The extended mother groups are nested: 

Moreover, we have the following lemma. 
Lemma 5.4. If go, . . .,gm-i e ^*m,f, then 

{{90, ■ ■ ■ , 9m-l)) e ^*m,r- 

Proof. By taking products, it suffices to prove this for the case when all Qi are the identity 
except for one, which is a generator of 9Jt^^. Moreover, since Sym(m) C 9?l^r; ^y conjugating 
by a transposition r G Sym(m), we may assume that the non-identity entry is go- 

We now prove the claim by induction on the degree of go- Consider first elements of type 
a- For go = a_i,cr = cr this holds by the definition of OKj^^ (this is the reason for the choice of 
in the definition). For higher degree elements, by definition 

Olk,a = {{0lk,a: Olk-l,a: 1, ■ ■ ■ , l)), 

SO 

{{ak,a, 1, • • • , 1)) = «fc,<7((l, ttfc-l,<7, 1, • ■ • , 

Now, ak^a G ^*n,f 3Jt*„ and the induction hypothesis implies ((1, afc-i,^, 1, . . . , 1)) G ^m,r 
as well, hence so is their product. 

The proof for type /? elements goes the same way. □ 

Lemma 5.5. If gi, . . . , gm G ^m,r-i) '^'^^ P ^ Sym(m) has O.p — then 

a = {{a,gi, . . .,grn))p e DJl*,^^^- 

Proof U g is a state of DJl^ then by definition a = {a, g,l, . . . ,1) G 37t*^ ~. Multiplying such 
states shows that this is true for any g G ^m,r-i- Define = (cj, gi,l, . . .). We have 

where (Ij) is the transposition, (ij) and p are in OJl^ ~ since they are of type Pq- Thus a G 9Jt^ jr 
as claimed. □ 
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The key step in the following construction is a conjugation due to iBartholdi et al.l (120081 ). 
where it was used for bounded automaton groups. Consider the automorphism 



where 7 is the cyclic shift (012 ... m — 1). 

Lemma 5.6. For every hierarchy level h the conjugated grandmother group G^f^ is a subgroup 
of the mother group DJl^ j^. 

Proof. The proof proceeds by induction on the h. On each level, it suffices to show this for the 
group generators. For hierarchy level —1 we have 

= ((70-o-,...,7— 

is an element of W2 = -i- 

Assume the lemma holds for hierarchy level r — 1. Let g = {{go, . . .gm-i))o' be a generator 
of Gm,r- Assume that gk = g is the loop at state g. We need to prove that g^ is an element of 
0JIJ„ ~, for which it suffices to show that this holds for h = ■y^g^'j~^-'^ . We have 

h = {{ho, /i^_i))7V7-'-^ where = Y+'^?f+,7-^'+'^-^ 

Now Hq = h, the hi, h2, . . . are in ^M^^_l, and ■y'^a^y''''" fixes 0. Thus by Lemma 15751 we have 
that h G Tl*^ ~. 

To get from hierarchy level r to hierarchy level r, suppose that g = {{go, . . •5'm-i))c where 
gi are generators of DJl^ ~. Then 

by the inductive hypothesis and Lemma 15.41 □ 
Embedding in dJl 

The extended mother groups contain the ordinary mother groups, but there are also embeddings 
in the other direction, as the following lemma shows. 

Lemma 5.7. The r + 3-collapsed version ofdJl^^. is isomorphic to a subgroup 0/ OJl-^r+a 

Proof. It suffices to show this for generators. Clearly, the r + 3-collapse of any element in Wr+3 
is just an element in Sym(m''"'"^). The r + 3-collapsed version of any other state of type a or /3 
is of the form 

« = {{a,9i, ■ ■ ■,9m'-i))p 
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where p G Sym(m^"'"^) and O.p = 0, and the QiS are states of lower degree than a. Just as in 
Lemma 15.51 we define from gi by 

ai = {{ai,gi, !,...,!)) 

and we note that Oj is a state of OJl^r+s Any p with O.p = is just Pq, p, and so is also in 9Jl. 
Thus we can conjugate by the transposition (li) and write 



Level subgroups 

Observe that the group of automorphisms of the first two levels of fixing and its children 
is isomorphic to Sym(m) I Sym(m — 1). 

For each a G Sym(m) I Sym(m — 1) and each word w in the symbols {1, . . . , m — 1} let X^^a 
denote the element of Aut(T) acting as follows: If the first \w\ nonzero letters agree with w, 
then Xw,a permutes the \w\ + 1st nonzero letter and the following letter by cr. Otherwise Xw,a 
does nothing. 

For a word w of length k, define the group generated by X^^a as a ranges over Sym(m) I 
Sym(m — 1). Define the group £rn.,fc to be the group generated by the £1^^ for all words w of 
length k. Define further 2m,-i = Sym(m). 

Later, we will consider random walks on the mother group whose step distribution is a 
convex combination of uniform measures on the subgroups £m,fc for various /c's. 

Lemma 5.8. For each w, ~ Sym(m) I Sym(m — 1). The group £,m,k is a subset ofTlm,k 
and is the direct product of ilj^ ^, for w E {1, . . . ,m — 1}'^. 

Proof. The structure of follows from Xw,aXw,a' = A^^o-o-'- Generators corresponding to 
different words w of equal length commute, hence is the direct product of the -C^,*:'^- 

For w = 11 ... 1 of length k we have that S^^ ^ generated by the Pk,p and ak,aS, hence it 
is a subgroup of Tlm,k- 

Now let w he a general word of length k, and note that the automorphism 



changes only the first k non-zero letters, and if they are all 1 it changes them to the letters of 




as an element of DJlmr+a^r- 



□ 



b — f3k-i,{iwk) ■ ■ ' Po,{iwi) 



w. Thus is the conjugate by b of £m fc^; therefore £^ C ^m,k for any w. 



□ 
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6 Patterns 



The random walks on the mother groups that we consider below have a step distribution that 
is a mixture of the uniform measures (ji on the finite level subgroups and their convolutions. 
It is convenient to think of the measures as elements of the group algebra MQKa;. 

The elements G M.dJlk satisfy the relations qf = q^ (being uniform measures on subgroups), 
and possibly some other relations that are less tractable. Therefore we introduce the more basic 
semi-group (with identity 0) Qk given by the generators . . . , g^, (which we call pattern 
letters) and the relations qf = qi. Elements of Qk are called patterns. We further define 
-D = -Dfc C Qfc to be the set of patterns that contain the letter 

Since the measures satisfy all relations satisfied by the qi, the map qi ^ qi extends 
multiplicatively to a unique semi-group homomorphism M9Jlfc. The image p of a pattern 

p is called the evaluation of p. 

Each pattern is an equivalence class of words in the letters {q'-i, Q'o, • • • , ?fc}- The equivalence 
relation is that repetition of a symbol is equivalent to a single instance. For example, for a, 6, c 
in the set of letters, we have abaacaaabba = abacaba . Composition is concatenation. The 
length of a pattern p, i.e. the length of the shortest element of its equivalence class is denoted 
\p\. The set Qo is of particular interest to us. It contains patterns in two letters, which must 
alternate. Thus Qo has only two patterns of any length. Finally, for a measure /i on patterns 
and a >0 the ct-moment of the length is denoted by 



The main reason for the definition of patterns is that they have behave nicely with respect to 
the ascension operator. To make this statement more precise, consider a probability measure 
II supported on patterns. is a convex combination of measures concentrated on a single 
pattern, and is naturally an element of IRQ^. Its image JI under the quotient map is an element 
of MSJlk, or more precisely, a probability measure on 971^. The measure JI is will also be called 
the evaluation of /x. It is given by the formula 



peQk 

Probability measures of the form JI are special. Nevertheless, this class is preserved by the 
ascension operator T. This suggests that the ascension operator can be defined on the level of 
patterns. 

The ascension formula 

Let {Zn} be the random walk on G, with steps X„, so that Z„ = Xi ■ ■ ■ Xn, and write X„ = 
((X„(0), . . . , Xn{m — l)))cr„. Clearly the trajectory of 0, is itself a Markov chain: If J„ = O.Z„ 



peQk 
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then Jn = J„_i.o"„. The section Zn{0) is not itself a Markov chain (except for very restricted step 
distributions). However, the pair (J„, Zn{0)) is a Markov chain, since Z„(0) = Z„_i(0)X„( J„_i): 
to determine the action of Z„ on the subtree above we need to know the action of on 
that subtree, and a single secti on of X„, wit h index given by O.Z„_i. 

A key observation used by iKaimanovichI (120051 ) is that if we consider the process {Z„(0)} 
conditioned on {Jn} we get a process with independent increments. Moreover, if Z„(0) = 
Yi - ■ ■ Yn, then Y^ depends on { J„} only through the pair J„_i, J„. Indeed the law of Yn is the 
distribution of Z„(J„_i) conditioned on J„_i.(T„ = J„. 

Such a process can be described naturally by the mxm matrix M with entries in the group 
algebra MG, with entries 

M,,=EX{{)l{,,x=j}. (3) 

Here X e 971^ is a step of the original random walk, and therefore the section X{i) is also in 
the group. To take its expectation consider X{i) as an element of the group algebra MSJlfc. 
Note that the entries of this matrix are interpreted as subprobability measures. Their total 
mass ||Mjj||i are the transition probabilities for the Markov chain {Jn}- A useful consequence 
of this definition is that for /i = yUi * we have M = M1M2. 
For an m X m matrix M, consider its block decomposition 



M 



Moo 


Mo, 


M,o 





so that M^,* is an (m — 1) x (m — 1) matrix. 
Proposition 6.1. Let M be as above, then 

Tfi = Moo + Mo,{I - M„)-^M,o, 
where (J — A)~'^ means I + A + + ■ ■ ■ . 



This appears as Theorem 2.3 in IKaimanovichI (120051 ). Note that M,^, is an (m — 1) x (m — 1) 
matrix with entries in MG. We include the above discussion and the brief proof because similar 
ideas are used next for measures on patterns. 

Proof. With the above notation, consider the trajectory of until the first time r such that J^ = 
0. The probability of a particular trajectory = Jo, Ji, ■ ■ ■ , Jr = ^ is a product of transition 
probabilities ||Mjj||i of the J Markov chain. Conditioned on these values, the process Yn is a 
product of independent samples from the probability measures Mjj/||Mjj||i corresponding to 
the transitions. Paths of length r correspond to the term Mo*MJ"~^M=ko, and r = 1 gives the 
Moo term. □ 
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Ascension formula for patterns 

Suppose now that /i is a probability measure on Q^, and consider the random walk on DJlk 
with step distribution JI. This measure has a great deal of symmetry. A uniformly chosen term 
X G 2.-1 sends any z to a uniform vertex on level 1 of the tree, and has trivial sections X{i). An 
element X G ilj for i > stabilizes the first level of the tree, and its sections are X{0) = X and 
X(i) G £ again uniformly. Similarly, an element X G £0 fixes 0, permutes {1, . . . ,m — 1} 
uniformly, and its sections are X{0) = X and X{i) G again uniformly and independently. 

Thus the matrix M defined above is as follows: 

• For £ > we get a diagonal matrix, with terms Mqo = ql and Ma = qi-i. 

• For g_i it is the constant matrix Mjj = 1/m (entries are the measure of mass 1/m on the 
identity element of the group). 

• For we have Mqo = g-i, and Moi = Mjo = for i 7^ 0. The remaining minor has 
constant entries: Mij = for i,j > 0. 

Note that in all cases, all entries Mij are in MQ^. 

Next, consider a measure p for a pattern p = Qai ■ ■ ■ Qar Since p is a convolution of g^, the 
resulting M is a product of the corresponding M's, and also has entries in MQ^.. Finally, from 
(13]) we see that M is linear in the step distribution. Since /I is a convex combination of p, we 
see that JI also gives M with entries in MQfc. 

Further more, note that for all qe we have that the following sets of entries are all constant: 

• the first row, except Mqo; 

• the first column, except Mqo; 

• the main diagonal, except Mqo; 

• Mij for i ^ j and i, j > 0. 

Matrices with these properties form an algebra, and therefore the same identities hold for any 
JI. This allows us to simplify some subsequent computations by using the 2x2 matrices R 
defined by 

Rqq = Mqo, -Roi = Mqj, 

j>0 
j>0 

where M is defined by ([3]) for a sample of Jl. (Taking any z > in place of 1 will not make a 
difference.) Since M \—>- R is a homomorphism of matrix algebras, /i t— > i? is a homomorphism 
from MQfc to the matrix algebra of 2 x 2 matrices over MOJlfc: if /i = yUi * yU2 then R = R1R2. 
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We now introduce the operators TZij, so that for any measure fi on we have RijJI = TZijfi. 
Since i— >■ i? is a homomorphism, it suffices to define TZ for patterns of a single letter, and 
extend it by multiplicativity and linearity to all of MQa:- It is clear that the following definition 
satisfies this desire. For k > 0, define 



Qk i = j = 0, 
Qk-i i = j = 1, 
t^j. 



Note that A; = no longer needs special care (an advantage of using R over M). For k = —1 
define 

^ - J = 0, 



Tlijq-i 



J = 1- 

m. 



It follows that the total mass of TZiju for any measure on patterns is given by 

n^AQk) = i.=,(i - i^m + u{dY- ^, (4) 

where D is the set of patterns containing the letter Let TZ^ to be the matrix with entries 
IZijii. As noted above, for a pattern p = g^^ ■ • • we have, using matrix multiplication. 
Tip = Yl^Qcti- For measures /i we define TZfi = J2pf^ip)'^P- 

A key consequence of this construction is that a version of Proposition 16.11 holds with TZ. 
Define the pattern ascension operator by 

Tfi = TZoofi + (7^ol/i) (l - Tliifi) \7^lo/i), (5) 

where (1 — a)^^ = 1 + a + + ■ ■ ■ . 

Proposition 6.2. For a probability measure fi G M.Qk we have TJl = Tfi. 



Proof. Let M be the matrix corresponding to /i. By Proposition 16. H 



Tji = Moo + Mo, (/ - M„) ^M,o. 

However, since M is in the aforementioned matrix algebra, it is straightforward to verify that 
R^iR'l^Rio = Mo^M^M^o, and hence 

Tji = i?oo + Roi — Ru) Rio- 

Since RJI = TZfi, this implies the proposition. □ 

We now inspect the structure of TZ in more detail. Since TZ is multiplicative, for some 
patterns its entries are delta measures: 
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Lemma 6.3. For a pattern p = Qa^' ' ' Qat that does not contain the letter we have 



TZijp = < 



p i = j = 0, 

t^J. 



For a general pattern with decomposition p = PoQ-iPi ■ ■ ■ q^iPe into patterns Pi G D'^ we have 

^ / 1 m-l \ 

i=\ ^ ' 

Proof. Since TZ is multiplicative, the first part holds by definition. The second part follows 
from the first and from TZijQ-i = {m — ly /m. □ 

Define the averaged legacy operator 



M6{0,1} 



m 



so that Afi = ^^7^) ("^Z^) ■ Note that when is a probability measure, so are TZijfJ., 
and Afj.. 

Lemma 6.4. Let p = po q-i pi q-i ■ ■ . q-i Pk where pi G D'^ he the decomposition of p into 
suhpatterns not containing q_i (possibly Pq or p^ are 0j. Then 



Ap= iApo)---iApi). 

Proof. This follows from the fact that TZ is multiplicative. Writing TZq-i 
have 



1=0 



1_ m-l 

m m 



1_ m-l 

m m 



1 »n— 1 



m m 



1 m-l 



m m 



{np, 



1_ m-l 

m m 



(6) 



we 



(npo)(nq-i)(npi) ■ ■ ■ (7^g_l)(7^p, 



Ap. 



□ 



Algorithmic description of T 

We now present an algorithm that uses samples of fi to get a sample from Tfi. This algorithm 
is an interpretation of Proposition 16. 2[ though it is also possible to use it to define T, and 
derive Proposition 16.21 from it analogously to Proposition 16.11 
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To sample T/x we consider the Markov chain {Sn, Qn}n>o with state space {0, 1} x and 
transition probabihties 

F{£n = j, Qn = P\£n-1 = i, Qn-l) = {Tlij^i){p). (7) 

Lemma 6.5. If Sq = and r > is minimal such that £r = 0, then Qi - ■ ■ Qr has the law of 

Proof. This is almost identical to Proposition 16. 1[ The term T^-ooA* corresponds to r = 1, and 
the term (7^olAi)(7^ll/i)"(7^lo/i) to r = n + 2. □ 

Since TZfi is linear, the algorithm above can be broken into to the following steps 

• Start with So = 0- 

• For i = 1,2, . . . sequentially pick ^ n, and pick Q^) with distribution 

P(£:„ = j, Qn = p\Sn-l = i, Qn-l) = {TlijPn){p)- 

• At the first time r > 1 so that £j. = 0, stop and return the pattern Qi - ■ ■ Qr- 

7 Properties of T and the sequence T^/i 

In this section we study the evolution of a measure /i on patterns under iterated ascension. 

Proposition 7.1. For a probability measure fi on patterns, the moments of the pattern lengths 
with respect to Tfi satisfy 

r^i\p\) < mA^i\p\), r/x(|p|2) < ^^Aix{\p\y + mAfii\p\^). 

[m — l)iJ,[D) 

Proof. Because length is subadditive {\pq\ < \p\ + \q\), implies that the first and second 
moment of the length of a sample from T/i are bounded above by the first and second moment 
of 

where TZij refer to induced measures on length, and the above formula takes values in the 
semi-group algebra of Z_|_. Let fij{z) = '^n{J^ijlj){n)z'^ denote the generating functions for the 
measures Tiijii. The generating function of the measure 6 is 

/ = V^oo + (l - V^ii) Vio- 
The first and second moments of 9 are given by /'(I) and /'(I) + f"{l) respectively. 
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We have formula (jlj) for the total mass: 
It follows that 

/'(I) = ^U^) + ^',,{1) + (m - l){ip[o{l) + Vuil)) = mAfxi\p\). 

The second moment is computed in a similar fashion, using second derivatives. With the 
notation Kij = (^^^(1) = Rijfi{\p\), a straightforward computation gives the second moment 
/"(!) + f(l) of^as 



2m T\ \ A f\ \2\ 2m (m^/i(|p|)) 



2 



/ ('^10 + i^u){i^oi + {m- 1)ku) + mAiJ,{\p\ ) < — — h m^/i(|p| ). □ 

Lemma 7.2. We have the following (where D G Qk is the set of patterns containing q-i): 
(a) For any x ^ D we have Tfi{x) > fi{x). 
(h) IffiiD) > 0, then Tfx{D) < fi{D). 

(c) If fi is supported on Qk~i U {qk} then so is Tfi, and fi{qk) = T^i{qk)- 

Proof. We refer to the algorithmic descrition of T/i and Lemma I6.3[ If the first pattern Pi 
chosen from is in D'^, then = and Qi = Pi is also the output of the procedure. Claim 
(a) follows. 

For part (b), assume that fi{p) > for some p E D. Consider TZp as given by Lemma [6.31 
Taking only the ;^'7^oo terms in the product shows that TZooP assigns some positive probability 
6 > m^^ to the pattern p with all appearances of g_i deleted. This pattern is in D'^. Thus 
Ti2{D') > fi{D') +6fi{p). 

For part (c), note that for such measures, the only way to get qk in the ascension algorithm 
is if the first pattern selected from /i is g^, in which case it is also the output. □ 

Lemma 7.3. // the measure sequence {T^n} is tight, then its limit exists and is supported on 

Proof. Since the measures are tight they have subsequential limits that are probability mea- 
sures. Let z/ be such a limit. T is continuous, so there are k so that z/ and T^n are close enough 
to have 

T^+^li{D) - Tu{D) < e. 
But again since i/ is a limit point there is £ > A; so that 

v{D) - T^i2{D) < e. 
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Summing the last two formulae and using that T^fi{D) < T^^^fi{D) we get 

iy{D) - Tu{D) < 2e. 

Since e is arbitrary, Tv{D) = ^{D). By Lemma [7.2( b). this implies that z^(-D) = 0. 

Now, T^fi{x) is monotone in k for x ^ D, so all subsequential limits are equal. □ 

Lemma 7.4. Consider an entropy-tight set of measures {/ij} on Qk, such that < M for 

some M. Assume further that for patterns p G IJj supp(yUi) we have H{^) = o{\p\). Then {Jli} 
is also entropy-tight. 

Proof. Denote K = {g E 9Jlm,fc : \g\ < and consider a measure /i on Q^. By definition, 

p 

For compactness, denote h{x) = — xlogx, and note that h is subadditive. We have 

\g\>e \g\>i V 

Since p is supported on group elements of length at most the above equals 

\g\>e\p\>e \p\>e g 

However, since p is a probability measure, for any a > we have 

Yh[ap{g)) = - J^(loga + logp{g))ap{g) = h{a) + aH{p), 
a 9 

and therefore 

Hk^ji) < KM) + E ^'(p)H{p)■ (8) 

lpl>£ \p\>t 

Now fix e. For I sufficiently large, the first sum in (IHl) is uniformly small for all ^ in our entropy- 
tight family. Also by assumption, if ^ is large enough then either = or < e\p\. 
Thus for large enough £ we have 

HK^{-fi) <£+Y <^ + Me. 

Since e is arbitrary, this implies that {jli} is entropy-tight as claimed. □ 
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8 Preliminary results for entropy of patterns 



Here we establish some useful facts about entropy of /i for some measures /i on patterns. We 
first recover the following result. 



Theorem 8.1 ( Bartholdi. Kaimanovich. and NekrashevychI (j2008l )). Let fi be supported on pat- 



terns of length 1 in Qq. Then H^{jl) = 0, or equivalently, the entropy H{jl*"') = o{n). 

We present what is essentially the original proof, written in the language of this paper. 

Proof. Lemma IT^ c) shows that T'^/i is supported on {go} U Q_i = {qo,q-i,^}- Thus the 
sequence T^/I is tight and entropy-tight. Lemma [7.31 shows that the u = lim T'^/i exists and is 
supported on {qo,^}. Thus supp(z7) is contained in a finite group, and Hooij^) = 0. The result 
now follows by Theorem 14. 2[ □ 

Next, we show that evaluation of patterns on Qq have entropy which is sublinear in their 
length. We need the following simple lemma. 

Lemma 8.2 (Monotonicity of pattern entropy). If p and r are patterns, then 

H{pf) > max {H{p), H{r)) . 

Proof, pr is a convex combination of measures of the form pg where g is chosen according to Q. 
All these measures have the same entropy H{f). Entropy is a concave function, so by Jensen's 
inequahty 

H{pr) > H{p). 

In the same way, H{pf) > H{r). □ 
Lemma 8.3. There is a function f{n) = o{n) such that for any pattern p ^ Qo 

Hip) < /(bl). 

Proof. Let /i be the uniform measure on {g_i, go}, and fin = /x*". By concavity of entropy and 
Jensen's inequality, since Jhi = J2 l^n{j>)p, we have 

> J2 f^n{p)H{p) > J2 f^n{p)H{p) 
p6So P&Ai 



where Ai consists of all patterns of length greater than i. By Lemma 18. 2[ the entropy of each 
word of length n is greater than the entropy of each word of length n — 1 (since patterns in Qq 
are just alternating go's and g_i's). It follows that 
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where Q\ can be either of the two patterns of length ^. 

The length of a sample of /i„ is the number of runs in the word Qi - ■ ■ Qn with i.i.d. letters. 
Since each letter starts a new run with probability 1/2, the length is binomial, and symmetric 
about n/2. Thus with n = 2i we have finite) ^ 1/2- We find 

H{Q*) < 2H(JI^) = o(£). □ 

We can also make some conclusions about linear automaton groups. It is not hard to see that 
the combined supports £.q,2^i of the evaluated patterns %,qi generate a bounded automaton 
group. Certain walks on this group also have zero asymptotic entropy. 

Lemma 8.4. Any measure /i on Qi supported on the patterns {qo,qi,^} satisfies HooiJl) = 0. 
Proof. Consider the subgroup of consisting of automorphisms of the form 

9 = {{9,9i, ■ ■ ■,9m-i))p, 

where O.p = 0. Note that the support of JI is contained in this subgroup. Consider a random 
walk Xk with step distribution /I, and write 

Xfc = ((Xfc,n(i),...,n(m-i)))pfc. 

The key observation is that the distribution of each Yi{i) is fi' where p' = TZnp. It follows that 
for each i > the process (Yk{i), > 0) is a random walk on Tlm,o with step distribution /i', 
where /i' is supported on {g_i,go5 0}- By Theorem 18.11 this random walk has zero asymptotic 
entropy. On the other hand, X is determined by p and the Fj's, so 

m—l 

H{X,)<H{pk) + Y,H{Y,{t)). 

i=l 

Thus ifoo(p) < (?Ti — l)Hoo{p') = 0, as required. □ 

9 The linear-activity case and the main theorem 

The key step in the proof of the main theorem is the following proposition. It is based on 
results from the previous sections. 

Proposition 9.1. Let p be a measure on Qo U {gi} with < oo. Then the sequence 

Pk = T^p is both tight and entropy-tight. 

The tightness of the sequence depends on a certain contracting property of the ascension 
operator T for patterns on Qq. Proposition 17.11 gives bounds on moments of T p in terms of 
moments of Ap. The next lemma bounds the moments of Ap. 
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Lemma 9.2 (Length of legacy). For a probability measure fi on {qi} U Qo, we have 



AK\p\) < 1 + ^^M, AM') < ^^^^^MbP) + 3/i(H). 



Proof, /i is a convex combination of delta measures on a single patterns. Since Afi is linear, and 
the bounds are all affine in /i, it suffices to prove the claim for delta measures fi = p supported 
on a single pattern p E {qi} U Qq. 

If p G {0,gi,g_i}, then Afi is supported on patterns of length at most 1, and the claim is 
trivial. 

Otherwise, fJ^ = p for some p G Qo- Let t > be the number of times go appears in p, so that 
\p\ >2t — 1. By ([6]), a sample of Afi is given hj Xi ■ ■ ■ Xt where Xi are i.i.d. with distribution 
Aqo = ^qo + ^^^Q-i- Each run of go's or g_i's reduces to a single letter, so the length R of 
Xi ■ ■ ■ Xt is given by the number of runs. 

Let Ni be the indicator of the event that Xj starts a new run, so that R = Ni + . . . + Nj. 
Then Xi = 1 and for 1 < i < t we have EXj = . Now, 

^ 2(m — 1) , , m — 1 , , 
ER = 1 + ^ — ^ t - 1 < 1 + ^ b • 

For the second moment, since Xj is independent of Nj unless |i — j| < 1, for t > 1 we get 

\2 T\2 



^ 9 ^ 4(m — 1) , ,9 (m — 1) , ,9 
ER^ = y ENiNj < ^ t - 1 2 + 5t - 6 < ^ b + 3 b • 

i,j<t 



Here, 5t — 6 is the number of pairs that are dependent or involve Xi. The right hand side is a 
bound for the E(i?V) 

even for J = 0. □ 
Combining Lemma 19.21 with Proposition 17.11 gives the following contraction property of T: 
Corollary 9.3 (Contraction). For any probability measure fi on QqU {gi} we have 

TM) < ^/.(bl) + m, TMbD < -^M\P\) + mf + ^^^^^/i(br)- 

Proof of Proposition l9A[ Any non-negative sequence satisfying Xk+i < c + axk with a G [0, 1) 
must be bounded. By the first claim of Lemma [9T3l the sequence fJ>k{\p\) is bounded, which 
implies tightness. 

Similarly, if > is a non-increasing sequence, then for any sequence > the inequalities 
Xk+i < c/sk + OiXk imply that SkXk is bounded. Set Sk = fikiD), which is non-increasing by 
Lemma 17.2( b). With the second claim of Lemma 19.31 and fik{\p\) < c, this implies that the 
sequence Hki^D) ■ /ifc(bn is also bounded. 
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For entropy-tightness, Since /i^ is supported within {qo,qi,^} U D, it suffices to show that 
the contribution to its entropy from the set D converges to 0. Let Uk denote fik conditioned to 
the set D. The contribution to H{^k) from D is given by 

^ -fikiP) log/ifc(P) = ekH{vk) - ek\og£k- (9) 

P&D 

We have 

..(bp) < < ^. 

Sk £k 

However, note that z/^ is supported on patterns of alternating go's and g_i's, so we have 

Hiuk) < 1 + Hih) (10) 

where Uk is the induced measure on lengths. Lemma [3.11 bounds entropy in terms of variance, 
and hence in terms of second moment. In our case it gives 

H{h) < 1 + C + ( log(C/4))+ <C' + I log^l. 

Combining this with ([9]), ( fTOl) . and Lemma [7.31 which shows that Sk 0, completes the proof. 

□ 

We are ready to prove the following, which implies Theorem [H 

Theorem 9.4. Let fi be a probability measure on patterns in Qq U {qi} with < oo. The 

random walk on the linear- activity mother group with step distribution JI has zero asymptotic 
entropy. 

Proof. By Theorem 14.21 it suffices to show that the sequence T'^JI is entropy-tight and converges 
weakly to a measure with zero asymptotic entropy. 

Consider ffist the sequence T^fi of measures on patterns. Proposition 19.11 shows this se- 
quence is tight. Lemmas 17.21 and 17.31 imply that z/ = lim T^/i exists and is supported on 
{gi,go)0}- Finally, by Lemma [8^ evaluation of measures supported on {qi,qo,^} have zero 
entropy, so HooiT') = 0. 

It remains to show that T^Ji is entropy-tight. This follows from Lemma [731 so we verify its 
conditions. Entropy-tightness and uniformly bounded expected length for T^/i are proved in 
Proposition 19. 1[ The support of any T^fi is contained in {qi} U Qq. By Lemma [8^3] the entropy 
ofpG {gJU Qo is o(b|). □ 

Acknowledgements. We thank V. Nekrashevich for bringing the example of the long-range 
group to our attention. 
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